CDS

Accession Number TCMCG021C21325
gbkey CDS
Protein Id XP_010933638.1
Location join(22811286..22811357,22814854..22815057,22815129..22815164,22815259..22815371,22826378..22826423,22831573..22831619,22831781..22831888,22835263..22835338,22836661..22836733,22836810..22836955,22837249..22837334,22839580..22839719,22839785..22839924,22840002..22840052,22840824..22840887,22842920..22843063,22843155..22843222,22843327..22843385,22843546..22843702,22852968..22853020,22853108..22853206,22853653..22853767,22853842..22853964,22854345..22854515)
Gene LOC105053980
GeneID 105053980
Organism Elaeis guineensis

Protein

Length 796aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010935336.3
Definition DNA mismatch repair protein MSH4 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category L
Description DNA mismatch repair protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAAGACGGAGGCGGTGGAGGAGATAGATCCAGTTTCGTCATCGGATTGATCGAGAACAGAGCAAAGGAGGTTGGTGTGGCTGCTTTTGACTTGCGATCTGCTTCACTGGATCTTTCTCAATACATTGAAACAAGTTGTTCATATCAAAACACAAGAACCCTCCTGCATTTTTATGATCCTATGGTCATAATTGTTCCCCCAATCAAACTGGCACCAGATAGTATGGCAGGAGTATCAGAACTGGTAGATAAACATTATACATCACACAAAAAGATTACAATATCCCGTGGCTTCTTTGATGATACAAAGGGGGCCATGCTGGTTAAAAATCTGGCAGCTAAGGAGCCATCTGCTCTTGGTTTGGATACCTATTATAAGCAATATTACCTCTGCCTAGCTGCTGCTGCTGCTACAATCAAATGGACTGAAGCAGAGAAAGGTGTAATTGTTACAAATCACTCATTGTCGGTTACATTCAATGGCTCATTTGATCACATGAATATTGATGCGACCAGTGTCCAAAATTTGGAAATTATTGACCCATTGCATTCTGAACTATATGGCATTGGCAACAAGAAGAGAAGTCTATTCCAGATGTTGAAGACTACGAAGACTGTTGGAGGGACTAGACTACTTCGTGCCAACTTGTTGCAACCTTTGAAAGACATGGAAACTATCAATGCTCGCCTGGACTGTTTAGATGAGTTAATGAGCAATGAGGAGCTATTTTTTGGTTTGTCACAGGGGCTTCGTAAATTTCCAAAAGAAACAGATAAGGTACTCTGTCACTTCTGCTTTAAGCCCAAAAAAATCACAGAGGAGGTTTTGAGGCCTGCCAATGGTAGAAAGAGTCAAATGTTGATATCAGACATTATTATTCTCAAAACTGCTTTAGATGCCTTACCTTTTCTCTCAAAGGTACTTAAGGATGCAAAAAGTTTTCTTCTTTCCAACATTTACAAGACTGTTTGTCAAAATGAAAAATATGCAGGCATAAGAAAGAGAATTGGTGATGTGATTGATGAAGATGTAGTGCATGCAAGGGCTCCTTTTGTTGCCTGCACACAACAGTGTTTCGCTATTAAGGCTGGAATTGATGGACTTCTGGATGTTGCACGACGCTCTTTCTGTGATATGAGTGAAGCAATACATAATCTTGCAAACAAATACCGGGAGGAATTTAAGCTGCCAAATTTGAAAATTCCCTATAACAATAGGCAAGGATTTTACTTCAGTATTCCACTAAGGGATGTAAATGGAAAGCTTCCCGACAAATTTATTCAGGTCATGAAACATGGGAAGAACATGCACTGCTCAAGTTTTGAACTTGCATCTCTAAACGTGAGGAATAAGTCAGCTGCTGCTGAATGTTTTATCCGGACGGAACTTTGCTTGGAAGGGCTAATTGATGTCATAAGGGAGGATGTCCCCATACTAAGACTGCTTGCAGAGGTCTTATGCCTTCTAGACATGATTGTGAATTCATTTGCACATACAATATCCAGTAAACCAGTTGATCGCTATACAAGACCAGAGTTCACTGATAATGGTCCTATGGCAATTGATGCTGGAAGGCACCCTATTCTAGAAGGTTTACACAATGATTTTGTTCCTAACAATCTTTTTTTTTCTGAAGCATCTAATATGGTGATTGTAATGGGCCCAAACATGAGTGGGAAAAGCACTTATCTTCAACAAGTTTGTCTGATAGTCATTCTTGCACAAATTGGTTGTTACATTCCTGCTCGTTTTGCATCTCTAAGGGTGGTTGATCGCATATTTACACGGATGGGGACTGGGGACAACGTTGAACACAACTCAAGCACTTTTATGACTGAAATGAAAGAGACAGCTTTCGTCATGCAAAATGTTTCTCCCAAGAGCCTGATAGTTATGGATGAACTTGGAAGGGCTACTTCATCCTCTGATGGATTTGCAATTGCATGGAGCTGTTGTGAACATTTGTTATCTCTGAAGGCCTACACTATATTTGCTACACATATGGAAGACCTGTCTGAACTGGCAACCATTTATCCGAATGTGAAGATTCTCCATTTTGAAGTTGACTTGAGGAACAACCGCTTAGATTTCAAGTTTCATCTCAAAGATGGGCCACGACATGTGCCGCACTATGGTCTTTTATTGGCTGGAGTTGCGGGTTTACCAAGTTCTGTGATTGACGCGGCAAGGAATATCACTGTAAGGATCACAGAAGAGGAAGTGAAGAGGATGGACATCAACTGTGAGGAACACCATTCCATTCAAATGGTATACGGGGTTGCACAAAAACTGATTTGCTTGAAGTATTCAAACCAAGGAGAGGATTATATTCGGCAAGCTATACAGAATCTCAAGGAGGGCTTCAAGGAGGGCAGGTTAATATGTTGA
Protein:  
MEDGGGGGDRSSFVIGLIENRAKEVGVAAFDLRSASLDLSQYIETSCSYQNTRTLLHFYDPMVIIVPPIKLAPDSMAGVSELVDKHYTSHKKITISRGFFDDTKGAMLVKNLAAKEPSALGLDTYYKQYYLCLAAAAATIKWTEAEKGVIVTNHSLSVTFNGSFDHMNIDATSVQNLEIIDPLHSELYGIGNKKRSLFQMLKTTKTVGGTRLLRANLLQPLKDMETINARLDCLDELMSNEELFFGLSQGLRKFPKETDKVLCHFCFKPKKITEEVLRPANGRKSQMLISDIIILKTALDALPFLSKVLKDAKSFLLSNIYKTVCQNEKYAGIRKRIGDVIDEDVVHARAPFVACTQQCFAIKAGIDGLLDVARRSFCDMSEAIHNLANKYREEFKLPNLKIPYNNRQGFYFSIPLRDVNGKLPDKFIQVMKHGKNMHCSSFELASLNVRNKSAAAECFIRTELCLEGLIDVIREDVPILRLLAEVLCLLDMIVNSFAHTISSKPVDRYTRPEFTDNGPMAIDAGRHPILEGLHNDFVPNNLFFSEASNMVIVMGPNMSGKSTYLQQVCLIVILAQIGCYIPARFASLRVVDRIFTRMGTGDNVEHNSSTFMTEMKETAFVMQNVSPKSLIVMDELGRATSSSDGFAIAWSCCEHLLSLKAYTIFATHMEDLSELATIYPNVKILHFEVDLRNNRLDFKFHLKDGPRHVPHYGLLLAGVAGLPSSVIDAARNITVRITEEEVKRMDINCEEHHSIQMVYGVAQKLICLKYSNQGEDYIRQAIQNLKEGFKEGRLIC